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STABLE PROPAGATION OF MODIFIED FULL 
LENGTH CYSTIC FIBROSIS TRANSMEMBRANE 
CONDUCTANCE REGULATOR PROTEIN 
CDNA IN HETEROLOGOUS SYSTEMS 

5 FIELD OF THE INVENTION 

This invention relates to modifications in the cDNA 
of full length cystic fibrosis transmembrane conductance 
regulator protein (CFTR) which facilitates propagation 
and/or expression in heterologous systems. 

10 BACKGROUND OF THE INVENTION 

Cystic fibrosis (CF) is the most common, life- 
threatening, autosomal recessive disease in the Caucasian 
population. Approximately 1 in 2,500 live births is 
affected by this genetic disorder. Obstructive lung 

15 disease, pancreatic enzyme insufficiency and elevated 
sweat electrolytes are the hallmarks for CF but the 
severity of these symptoms vary from patient to patient. 
Patients with CF usually die at an early age due to lung 
infection. With recent advances in clinical treatments, 

20 which are directed against the symptoms, the mean 

survival age for patients has increased to 26 years. 

Despite intensive research efforts for the past 
fifty years, the basic defect in CF remains to be 
speculative. It is generally believed that the heavy 
25 mucus found in the respiratory tracts and the blockage of 
exocflhe secretion from the pancreas are due to imbalance 
in water secretion which is the consequence of a defect 
in the regulation of ion transport in the epithelial 
cells. 

m 

3<P The precise localisation of the CF locus on the long 

** arm of chromosome 7, region g31, facilitated the recent 
isolation of th* responsible gene. The C* gene spans 250 
kilobase pairs (kb) of DNA and encode a m*NA of about 
6,500 nucleotides in length. The CFTR gene is disclosed 
35 and claimed in United States application S.N. 401,609 
. filed August 31, 1989. That application is co-owned by 
the applicant of this application. 



Expression of this gene could be observed in a 
variety of tissues that are affected in CF patients, for 
example, lung, pancreas, liver, sweat gland and nasal 
epithelia. An open reading frame spanning 1480 amino 
acids could be deduced from the overlapping cDNA clones 
isolated. The putative protein as noted is called 
"Cystic Fibrosis Transmembrane Conductance Regulator" or 
CFTR for short, to reflect its possible role in the 
cells. The predicted molecular mass of CFTR is about 
170,000. 

Based on sequence alignment with other proteins of 
known functions, CFTR is thought to be a membrane- 
spanning protein which can function as a cyclic AMP- . 
regulated chloride channel . The internal sequence 
identity between the first and second half of CFTR 
resembles the other p r oka ry otic and eukaryotic transport 
proteins, most notably, the mammalian P-glycoprotein. 

The most frequent mutant allele of the CF gene 
involves a three base pair (bp) deletion which results in 
the deletion of a single amino acid residue 
(phenylalanine) at position 508, within the first ATP- 
binding domain of the predicted polypeptide. Although 
this mutation (AF508) accounts for about 70% of all CF. 
chromosomes, there is marked difference in its proportion 
among different populations. The remaining 30% of 
mutations in the CF gene appear to be heterogeneous and 
most of them are rare, with some represented by only 
single examples, as referenced in applicant's Canadian 
patent application filed July 9, 1990. 

The mutation screening study, con f^m s ^hat '"fne «bJ# 
binding domains detected by sequence aaignmen7j.s 
important for CFTR function as multiple, different 
mutations have been found for many of the highly 
conserved amino acid residues in these regions. The 
locations of the various mutations, also identified other 
functionally important regions in CFTR. There is, ""for 
example, a section three bp deletion resulting in the 
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omission of an isoleucine residue at position 506 or 507 
of the putative protein. While amino acid substitutions 
at these positions are apparently not disease-causing, 
this observation argues that the length of the peptide is 
5 more critical than the actual amino acid residue in the 
506-508 region. Further, the existence of a large number 
of nonsense, frameshift as well as mRNA splicing 
mutations in the CF gene implies that absence of CFTR is 
S'[ not incompatible with life. 

10 • The varied symptoms among difjffrent CP* g£tictots 

suggest that disease severity is at least in part related 
to the mutations in the CF gene. Such association, which 
is expected to be concordant among patients within the 
same family, as they should have the same genotype at the 

15 CF locus, is observed for pancreatic function. 

Approximately 85% of CF patients are severely deficient 
in pancreatic enzyme secretion, thus diagnosed as 
pancreatic insufficient (PI), and the other 15% have 
sufficient enzyme, thus pancreatic sufficient (PS) . 

20 Family studies showed that there was almost complete 
concordance of the pancreatic status among patients 
within the same family, leading to the suggestion that PI 
and PS are predisposed by the patients 9 genotypes. 
Subsequent studies showed that patients homozygous for 

25 the AF508 mutation were almost exclusively PI. This 
information may be useful in disease prognosis* 

There are other mutations that would be classified 
in the same group as AF508, the so-called severe mutant 
alleles with respect to pancreatic function. In 

30 contrast, patients with one or two copies of other class 
(i.e. mild) of alleles are expected to be ES. Meconium 
ileus which is observed in about 30% of CF patients 
appears to be a clinical variation of PI and not directly 
determined by the CF genotype. Other clinical 

35 manifestations are more complicated and no apparent 
association has yet been detected. 
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With the identification of the CF gene, a better 
understanding of the basic defect and pathophysiology of 
the disease can now be attained. Progress and advance 
are being made in studies of the regulatory mechanisms 
5 governing the expression of this gene, and of the 

biosynthesis and subcellular localization of the protein 
(through generation of antibodies against various parts 
of the protein) . In addition, it is important to develop 
effective assay systems for the function of CFTR. This 

10 information may be useful in development of rational 
therapies, including gene therapy. 

In order to obtain a DNA sequence containing the 
entire coding region of CFTR, it is necessary to 
construct a full-length cDNA from overlapping clones 

15 previously isolated. A major difficulty has been 

encountered in the process, however. As the various 
proportions of the full-length cDNA is being linked 
together by standard procedures; i.e., restriction enzyme 
cutting and ligation, with plasmid vector in E»cwtc*t« 

20 con, frequent sequence rearrangement has been detected in 
the resulting construct. 

For purposes of better understanding of the 
regulatory functions of- the CFTR protein and also for 
purposes of gene and drug therapy, it is useful to be 

25 able, in a commercial way, to propagate and express the 
normal CFTR gene and various mutant CFTR genes in a 
variety of hosts which include bacteria, yeast, molds, 
plant and animal cells and the like. 

Although propagation and expression of the cDNA 

30 sequence for the CFTR gene can be achieved in some 
vehicles, there are, however, the aforementioned 
difficulties in obtaining stable propagation of the cONA 
in some types of bacteria, particularly e. coil. it is 
thought that the cDNA contains sequence portions which, 

35 when propagated in the bacteria, results in a toxic 

effect which is countered by lack of propagation of the 
cDNA in the microorganism. 
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INVENTION SUMMARY 

We have discovered that a modification of a cDNA 
repeat sequence in exon 6 of the CFTR gene without 
modifying the amino acids encoded by the changed codons 
5 facilitates propagation and/or expression of the CFTR 
protein in living cells and in particular bacterial 
cells. 

According to an aspect of the present invention, a 
modified DNA sequence derived from a gene coding for 
10 cystic fibrosis transmembrane conductance regulator 
(CFTR) protein, the gene having at least 27 exons of 
which normal cDNA codes for CFTR protein, the normal cDNA 
including exons 6a and 6b wherein exon 6b includes a 13 
bp repeat, 

15 the modified DNA sequence comprises at least one of 

the 13 bp repeats of exon 6b having one or more normal 
nucleotides of the 13 bp repeat substituted with an 
alternate nucleotide which continues to code for a 
corresponding normal amino acid. 

20 According to another aspect of the present 

invention, a DNA construct for use in a recombinant 
vector comprises the modified cDNA. 

According to a another aspect of the present 
invention, a vector which comprises the DNA construct 

25 and a promoter sequence for the DNA. construct. 

• Apcording to futher aspect of the present invention, 
a host cell for producing CFTR protein, the host cell 
comprises the above vector whereby expression of the 
vector in the host cell produces CFTR protein. 

30 According to another aspect of the present 

invention, a CFTR protein isolated and purified from 
culture of the host cell. 

According to a further aspect of the present 
invention, a mammalian cell trams feet ed with the above 

35 vector to enhance CI" conductance through a cell wall. 
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DESCRIPTION OP DRAWINGS 

Figure 1 is the nucleotide sequence of the CF gene 
and the amino acid sequence of the CFTR protein; 

Figure 2 is a schematic diagram of the CF gene and 
5 its protein product. (A) Gene structure with exons 

represented by open boxes; (B) Computer-predicted primary 
structure of CFTR; and (C) Model of CFTR showing possible 
relationship with cellular membrane and potential 
glycosylation site (-CHO); 

10 Figure 3 is a DNA sequence of exon 6b. The 

nucleotide positions (877-1002) for exon 6 correspond to ' 
the previous cDNA numbering scheme (Riordan et al 1989). 
The 13 bp direct repeats are underlined. The consensus 
prokaryotic transcription signals (at positions -35 and - 

15 10) are included for comparison. The modified 

nucleotides in pCOF-1; pBQ6.2 and pBQ4.7 are shown 
underneath. 

Figure 4 is a schematic diagram shoving the three 
basic steps in the construction of full-length CFTR cDNA. 

20 Figure 5 is the generation of the 5' end of CFTR 

cDNA for the expression vector pCOF-1. Two overlapping, 
oppositely oriented oligonucleotides, Cofl (44-mer) and 
Cof2 (40-mer), were used in the generation of the exon 1 
sequence. A double stranded molecule was synthesized. 

25 from these two oligonucleotides with the Klenov fragment 
of DNA polymerase. An Ncol site was introduced at the 
initiation codon to facilitate subsequent cloning. The 
PCR-generated fragment was digested with Ncol and PvuII 
and llgated to the remaining portions of the CFTR cDNA, 

30 including the PvuII to Xbal fragment of cDNA clone 10-1, 
the PCR-modified middle fragment and the 3 • fragment . 
The procedures used to generate the two latter segments 
are described with reference to Figures 6 and 7. The 
position of the human metallothionein IIA promoter, the 

35 bacterial ampicillin-resistance gene, the pBR origin of 
replication and the SV40 origin of replication are 
marked. 
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Figure 6 is 3 'end construction* The 3' coding 
region of the expression vector (pCOF-1) and pBQ6.2 
was constructed in two sequential cloning steps. First, 
the 3» most portion, including the end of the coding 
5 sequence and the 3 1 untranslated region, was prepared by 
ligating a fragment from T16-4.5 (generated by Bcol 
partial digestion) to a fragment from the genomic clone 
TE27 (Rommens et al 1989) . The resulting fragment was 
then joined with the Sphl-BstXI fragment from Cl-1/5 

10 (riordan et al 1989) at the BsXI restriction site to 
yield a 4 kb 3' end fragment. 

Figure 7 is a schematic diagram showing the 
synthesis of the middle segment containing the modified 
sequence. Details are described herein. 

15 Figure 8 is propagation vectors pBQ4.7 and pBQ6.2. 

The construction of these plasmids are provided in 
description of the invention. Key restriction sites are 
marked. The sires of the transcripts generated by the T7 
RNA polymerase (from the PstI site to each of the marked 

20 restriction sites ) are also Indicated (in kilobases) . 
Figure 9 is DNA sequencing ladder shoving the 
modified sequence in exon 6b. The sequencing reaction 
was performed with the use of an oligonucleotide primer 
(19-mer) corresponding to a sequence in exon 6a starting 

25 at nucleotide 771 (5 1 H3CTAATCTGGGAGTTGTT-3 • ) • The 
altered nucleotide in the sequence are underlined. 

Figure 10 is in vitro translation of CFTR. The 
products from in vitro translation reactions were 
separated oxt a 10% polyacrylamide -SDS gel. The gel was 

30 stained by Coomassie Blue after electrophoresis; it was 
then soaked in Enhance 0 (New England Nuclear) and dried 
at 60 "C under vacuum b .f ore exposure to X-omat film 
( Kodak K L&n* 1: no added RNA; lane 2: protein 
translated from the Brome Mosaic Virus RBA (as size and 

35 reaction conditions control); lanes 3 and 4: duplicate 
samples of cappfed mRNA prepared by T7 RNA polymerase on 
template pQB6.2 digested with Hpal. The positions for 



WO 92/05252 



PCI7CA91/00341 



the protein molecular weight standards (purchased from 
BRL Labs) are indicated on the left in kilodaltons. The 
expected product of 8 2. 6 kb (from the RNA generated from 
the pBQ6.2 plasmid) is marked. Also marked (by arrows) 
5 are the sizes of the BMC protein products. 

Figure 11 is DNA hybridization analysis of 
integrated plasmids. Genomic DNA samples were prepared 
from each cell line, digested with BamHI and separated on 
a 0.7% agarose gel in Tris-acetate buffer. The DNA lanes 
10 were transferred to Zetabind (Bio-Rad Labs) membrane. 
The autoradiograph shows the hybridization result with 
the CFTR cDNA probe. Lanes 1 and 2 contain human and 
mouse genomic DNA, respectively. Lanes 3, 4, 25 and 26 
contain non-transfected mouse LTK-cell DNA. Lanes 5 to 9 
15 contain DNA from HAT-resistance clones obtained by 

transf action with pSTK7 only. Lanes 10 to 18 contain DNA 
from HAT resistant clones obtained by co-trans faction 
with pSTK7 and pCOF-l and lanes 19 to 24 with pSTK7 and " 
pCONZ. Molecular weight markers are shown on the left 
20 (in kb). The two diagnostic bands are 2.3 kb (the 5 'end 
of the CFTR gene plus the expression vector, promoter 
sequence) and 2.5 kb (CFTR internal segment); they are 
indicated on the right by the arrows. 

Figure 12 is rna hybridization analysis. Total rna 
25 was extracted from each cell line, purified through CsCl 
gradient centrifugation, and separated on a 1.0% 
formaldehyde agarose gel in HOPS-acetate buffer. The 
samples were transferred to Zetabind (Bio-Rad Labs) 
membrane and hybridized sequentially with radioactively 
30 labeled CFTR cDNA (panel A) and the TK cDNA probe (panel 
B). The samples in lanes 1 an* 21 jrere prepared from the 
colonic cell line T84; lane 2 from fresh mouse salivary 
tissue; and lane 3 from un trans fee tod LTK-cells. Lanes 4 
to 7 contain RNA prepared from independent clones 
35 generated from pSTK7 transf ection ; lanes 8 to 15 from 

cotransf action with pSTK7 and pCOF-l; and lanes 16 to. 20 
from co-transf ection with PSTK7 and pCONZ. The 286 and 
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186 rRNA bands are indicated. The position anticipated 
for the full-length transcript (6.2 kb) is also marked. 
The expected transcript for the HSV YK gene is 2.2 kb. 
Figure 13 is Western Blot analysis. Total protein 
5 extracts were prepared from the indicated cell lines, 
separated on a 7.5% polyacrylamide-SDS gel and 
transferred to nitrocellulose as described herein. Lane 
1: extract from line 2a-4A; lane 2: 4a-2D; lane 3:4a-3I; 
lane 4:6b-J. The blot was incubated with a mouse 

10 monoclonal antibody and followed by anti-mouse antibody. 
The immune complexes were visualized with alkaline 
phosphatase as described herein. Protein molecular 
weight standards are shown on the left. The arrow on the 
right marks the 170 kilodalton band. 

15 Figure 14 is expression vector pCOF-1. The complete 

CFTR coding region (open boxes) is positioned downstream 
from the human metal lothionein Ila (hMTIIa) promoter 
(hatched box) . The human metallothionein Ila initiation 
codon is joined with that of CFTR at an MCft I site 

20 introduced by the synthetic oligonucleotides. 

5 9 -CACTGCAGACCATGGAGAGCT 1 and 

5 1 -GACTGCAGCTGAAAAAAAGTTTGGAG^ 1 . The DMA 

sequence from exons 2 to 24 and its 3' flanking region 
was derived from clones 10-1 ( Pvu II-Xba in , T16-1 

25 I-SPh I ) , Cl-1/5 (SDh I-BstXll. T16B4.5 (BgtXI-NSQ I) , 
and TE87B2.3 f Nco I-Xho I ) , as indicated. In pC0F*F508, 
the 1-kilobase (kb) B«pTTT-ffPft a fragment was replaced 
with the corresponding fragment from clone Cl-1/5. 

Figure 15 is RNA and protein analysis of mouse L ^ 

30 cell lines expressing human CFTR. (A) Total RNA from 
Caco2 (Spg, lane 1), LTK" (10 fig, lane 2) , 6B-I (lfl^tg, 
lane 3), 4a-3I (10 ng, lane 4), and 5-2D (10 M9# lan* 5) 
cell lines were electrophoresed on a 1% 
formaldehyde/ agarose gel, transferred to Hybond-N 

35 (Amersham) , an hybridized with ^P-labelled cONA probe. 
The 6.2-kb CFTR mRNA of the Caco2 cell line is indicated 
with the arrow. The relative positions of the 28S and 



WO 92/05252 




PCT/CA91/00341 



10 



18S rRNAs as indicated (B) Protein fractions from nuclei 
and mitochondria (lanes A) and curde light membranes 
(lanes B) from the cell lines T84, LT1C, 6B-I, 4a-3I and 
5-2D. Bands were visualized by 125 I-labelled rabbit anti- 
5 mouse antibody. Molecular mass in kDa are indicated. 

Figure 16 is video microscopic detection of CI" 
permeability in single L cells, SPQ fluorescence 
intensities (F) are expressed reltive to SPQ fluorescence 
intensity in the absence of CI" quenching (F 0 ; the 

10 *- direction of changes in Fo/F reflect parallel changes in 
intracellular CI* concentration. (A-C) Single cells, each 
indicated by a different symbol, are shown containing a 
frameshift construct leading to a predicted truncated 
CFTR (A) ; a mutant construct CFTRaF508 (B) ; and an intact 

15 construct, CFTR (C) . Forskolin (10 jiM) and NO~ 3 medium 
were perfushed over the cells for the periods indicated 
by solid bars. Time scale in C applies to A-C. (D) A 
single CFTR cell was repeatedly pulsed with NO" 3 medium. 
Rate of fluorescence chage as well as peak response 

20 during constant pulses (1 min) was unchanged over the 

time course of exposure to forskolin, indicating that the 
cAMP- induced CI' permeability is sustained. (E) 
Simultaneous determinations of CI* concentrations (F«/F) 
(open and solid squares) and cell volume, expressed as 

25 relative area of constant optical section (open and solid 
circles), in. a single cell expressing CFTR* (solid squares 
and circles) ) and a single cell expressing truncated CFTR 
(open squares and circles) . As indicated, the cells were 
exposed to 5 fiK gramicidin and to 10 mm forskolin. 

30 Figure 17 is whole-cell CI" currents in traiuifected L 

cell. (A) Time course of whole-cell currents measured 
from cells transfected with the frame-shift CFTR control 
construct (solid diamonds) and the intact CFTR construct 
(open squares) after the addition of a solution 

35 containing 10 MM forskolin, l mM isobutylmethylxanthine, 
and 100 pM N*, C^'-dibutylyladenosine 3\5 ( -cyclic 
monophosphate. The arrow indicates the time of solution 



WO 92/05252 




PCT/CA91/00341 



11 

addition. Outward currents vera measured at Eg = +20 mV. 
(B) Whole cell current-voltage relationships for a CFTR- 
expressing cell before (Left) and 3 min after (Right) 
induced activation. The current scale for the 
5 nonstimulated cells is shown enlarged as indicated. The 
dashed line indicates the zero-current level, (c) Mean 
current-voltage relationships for CFTR cells (n - 10) 
before and after activation (solid circles and open 
squares, respectively). Replacement of bath NaCl with 
10 sodium gluconate resulted in a shift in reversal 

pretention of cAMP-activated currents (open triangle) 
(n - 4). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The normal CFTR gene sequence and protein sequence 

15 is shown in Figure 1. The sequence consists of cDNA 
identified and isolated irom chromosome of mammalian 
cells in accordance with the procedure identified in co- 
applicants United states patent application SN 401,609. 
It is believed that the cDNA of CFTR is made up of at 

20 least 27 axons as identified in applicant 1 s pending 
Canadian patent application filed July 9, 1990. 

The exact cause of the sequence instability is 
unknown, but the region frequently involved in this 
rearrangement is a 13 bp direct Repeat in exon 6a of the 

25 CFTR gene as shown in Figure 3. This region also shows 
sequence identity with consensus regulatory sequences in 
e. coif as noted in Figure 3. # it is possible that 
transcription initiates from this sequence, resulting in 
a product which is toxic to the bacterial host. It is 

30 also possible that the DNA sequence itself elicits an 

inhibitory response;* In consequence, the transformation 
frequency and growth of the desired plasmid construct 
become extremely poor. The rearrangement of this 
sequence detected in the recovered plasmids may have 

35 alleviated its toxicity in bacteria. Thus, the inability 
to obtain a full-length CFTR cDNA in the original cloning 
experiment (Rommens et al, 1989) may also be explained. 



WO 92/05252 




PCT/CA91/00341 



12 

Since most of the spontaneous sequence 
rearrangements detected disrupts the open reading frame 
of the full-length cdna, none of the resulting clones are 
expected to be useful in functional complementation 
5 studies. Although it is possible to use alternative 
host/vector systems to overcome this problem, the 
convenience of the e. coii/pBR322-based cloning system 
required a better system. 

The region of interest with respect to enhancing 

10 propagatibn of the cDNA in microorganisms and in 

particular in e. cou is in axon 6. A 13 base pair repeat 
has been found in exon 6. With reference to Figure 1, the 
region of exon 6 is identified. More specifically, 
within exon 6 the repeat sequences occur at positions 923 

15 through 935 and at 981 through 993. These repeat 

sequences are separated by approximately 44 base pairs. 

It has been discovered that by modifying one or more 
of the base pairs in region 924 through 936 or 982. 
through 994 without changing the amino acid encoded by 

20 the respective codon results quite surprisingly in the 

propagation of the cENA in a suitable microorganism, such 
as e. eon. The modified oDNA, in one of the repeat base 
pairs sections, does not produce a toxic effect in the 
microorganism so that a stable propagation of the cONA 

25 can proceed. The benefit in the stable propagation of 

cDHA is not only from the standpoint of expression of the 
gene to produce the protein, but also for producing 
multiple copies of the cDKA for the CFTR gene to enable 
other uses of the gene, such as in drug and gene therapy 

30 for correcting the effects of CP. 
Construction of full Length cDma 

Two types of full length cDNA clones have been 
constructed to evaluate the modified cDNA. One of them 
is for efficient propagation in e. coif and the other one 

35 for expression in mammalian cells. 

Since the entire, coding region of CFTR is present in 
multiple overlapping clones, it is necessary to construct 
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the full-length cDNA in three separate steps (Figure 4): 
construction of the 5* end of the cDNA (from the beginning 
of the coding region to the Xbal site in exon 5) , 
(Rommens et al, 1989) generation of the middle segment 
5 (from the XBal site to the SphI site in exon 10) 

including the critical region by the polymerase chain 
reaction (PCR: Saiki et al, 1985) , and (Riordan et al, 
1989) construction of the 3 'end of the cDNA from 
different existing clones (from SphI site to exon 24). 

10 The starting clones of this work have been described 
previously (Riordan et al, 1989). 

To construct the 5" end of the cDNA in the mammalian 
expression vector, oligonucleotides were used to 
synthesize the open reading frame between the initiation 

15 codon and the end of exon 1 (the PvuII site) . This 
fragment was inserted between the promotor of vector 
pSG3X and exon 2 of cDNA clone 10-1 (see Figure 5) . A 
single nucleotide modification (C to G) was introduced at 
the position immediately after the initiation codon to 

20 facilitate subsequent manipulation. The alteration also 
introduced an amino acid change from Glutamine to 
Glutamate. 

To construct the 3' end of the cDNA, DKA fragments .« 
were obtained from three existing clones (Figure 6) : SphI 

25 to BstXI (exons 10-20) from Cl-1/5 (Riordan et al, 1982), 
BstXI to Ncol (exons 20-24) from T16-4.5 (Riordan et al, 
1989), and Ncol to EcoRI (exon 24 to about 100 bp 
downstream from the polyadenylation signal) from TE27 
(Rommens et al, 1989). The genomic DNA fragment included 

30 at the end of this construct to ensure proper « 

transcription termination and mRNA processing. This 
3 'end construct was used for both the expression and 
propagation vectors. 

The cDNA clone T16-1 (Riordan et al, 1989) was used 

35 as the template for generation of the middle segment, 
which was used for both vector systems. To introduce a 
sequence modification in one of the 13 bp repeat region 
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of axon 6, the in vitro mutagenesis procedure based 
oligonucleotide-directed PCR (Higuchi at al, 1988; Ho at 
al, 1989) was used. Briefly, two overlapping segments, 
spanning axons 3 to 6 (536 bp) and axon 6 to 17 (867 bp) , 
5 were generated from T16-1 by PCR with two sets of 

oligonucleosides (sea Figure 7) . The two overlapping 
segments were joined by the PCR procedure with the 
outermost, flanking oligonucleotide primers. The 
product, 1.38 kb in size, was then digested with Xbal and 

10 SphI to generate the modified middle segment. 

The pBluescript* vector from Stratagene was used as 
the basis propagation vector. The 5 1 and of the full- 
length cDNA in this vector was derived from cDNA clone 
10-1 (between the PstI and Xbal sites) . The resulting 

15 clones ware named pBQ4.7 and pBQ6.2 (Figure 8). 

The plasmid pSGM3X was used as the basic mammalian 
expression vector. It is similar to p SGM1 previously 
reported by Meakin et al (1987), except that the human 
metallothionein promoter was inserted in an opposite 

20 orientation and that a Xhol site was inserted in the Kpnl 
site within the Ecogpt gene. The resulting expression 
vector was named pCOF-1 (Figure 5) . 
Sequence Verification 

Since errors are often introduced into PCR products ' 

25 due to infidelity of the Taq polymerase, the middle 
segment of the cDNA clones were f irylr verified by DMA 
sequencing. The procedure of didaoxy termination was 
used according to the DS Biochemical Sequenase* kit. 
Consequently, it was necessary to replace a small segment 

30 (the Xbal site in exon 6 to the Fspl site in exon if «of 
the clone with the corresponding segment from ^1601. 
Overlapping DNA sequencing was then performed for the 
entire open reading frame of each of the completed full- 
length cDNA constructs. 

35 la vitro Translation 

To produce large quantities of RMA from the 
propagation vector (pBQ4.7 and pBQ6.2) for the purpose of 
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in vitro translation, tha mCAP* mRNA capping kit from 
Stratagene was used. 

Since the open reading frame predicts a protein of 
170,999 kilodaltons (kd) , it would be difficult to 
5 produce a product of this size in vitro . It was 

therefore necessary to use differently prepared templates 
to examine various segments of the open reading frame. 
To prepare these templates, the propagation vector 
pBQ6.2) was digested with Fspl, EcoRI, Hpal and Xhol, 

10 accordingly. 

in vitro translation was performed with a rabbit 
reticulocyte lysate kit from Promega Corporation and U C- 
labelled leucine from Amersham. 
Trans f eotlon Studies in Moose LTJC Cells 

15 The calcium phosphate co-precipitation procedure for 

introduction of plasmid DNA into mouse LTK-cells has been 
described previously (Meakin et al, 1987). The plasmid 
pSTK7 was used for cotrans feet ion with the expression 
vector pCOF-1. As a control vector, a plasmid construct 

20 similar to pCOF-1, except for the deletion of a single 
base pair in exon 1 was used; this deletion was expected 
to result in premature termination of translation. 

The mouse LTK-cells were passaged in a-KEM medium 
supplemented with glutamine, 10% fetal bovine serum and 

25 antibiotics. Biochemical selection for TK positive cells^ 
was achieved in medium containing hypoxanthine, 
aminopterin and thymidine (HAT medium) . 
DMA and RXA Analyses 

Plasmid and DNA sa^oles were prepared from the 

30* bacterial cells and genomic DNA from trans ffeted mouse L 
cells. Total RNA was extracted from mammalian, cells for 
examination of gene expression. Standard procedures, 
essentially as described by Sambrook et al (1989), for 
DNA and RNA analyses were used. 

35 Protein Analysis 

Animal calls were harvested in TEN buffer (40 BH 
Tris-HCl, pH 7.5, 1 nN EDTA, 150 mM) , after the cells 
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were washed in phosphate-buffered saline. Cells were 
collected by scraping, resuspended in 2560 mM Tris-HCl 
(pH 8), and stored in -80 'C ready for total protein 
extraction. For SDS-polyacrylamide gel electrophoresis, 
5 the frozen cells were resuspended in the loading buffer 
and boiled for 5 min prior to electrophoresis according 
to the procedure of Laemmli et al (1970) . 

Transfer of protein from polyacryl amide gel to 
nitrocellulose was performed overnight at 0.4 mA 

10 according to the procedure of Towbin et al (1979) . The 
membrane was baked at 80 *C for 90 min. Detection of CFTR • 
was accomplished with the use- of a monoclonal antibody 
(gift of N. Kartner and J. Riordan) prepared against a 
portion of the predicted protein (amino acid residues 

15 347-698) and the ProtoBlot* AP System according to 
procedures recommended by the supplier (Pr omega 
Corporation) with 1% BAS as a blocking agent. 
Vector Construction 

Three plasmid constructs were made in this study - 

20 pBQ4.7; pBQ6.2 and pCOF-1 as described in the previous 

section. pBQ4.7 is a full-length cDNA clone contained in 
the Bluescript* vector; it contains the entire coding 
region except the 3» untranslated region. pBQ6.2 contains 
the same sequence as in pBW4.7 plus the 3' untranslated 

25 region. Both of these plasmids contain modification in 
the first 13 bp direct repeat in exon' 6b (Figure 3). The 
full-length cDHA construct in pCOF-l is inserted 
downstream from the human metal lothionein IIA promoter in 
the vector pSGM3X and the coding region is flanked by 

30 CFTR genomic DMA sequence at the 3» end. In addition to 
the modification introduced into the exon 6b region of 
the cDNA, as" in the two previous constructs, pCOF-1 has 
another modification (C to G) in the first nucleotide 
after the presumptive initiation (AXG)*codon in exon 1. 

35 As the first step in examining their integrity, 

restriction enzyme digestions were oised to derive a map 
for each of these full-length cDNA constructs. The 
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results showed that all of them produced a set of DNA 
fragments as expected. It is, however, of interest to 
note that there was one DNA fragment in each of the 
digestions which appeared to have a different mobility 
5 when compared to the pattern of the corresponding 

parental cDNA clones displayed on polyacrylamide gel. 
The mapping data located the fragments with the altered 
mobility to those containing the 13 bp direct repeats. 
To ensure the DNA sequence was intact as designed, 

10 the middle segment containing the modified segment was 
examined by DNA sequencing. The result showed that 3 bp 
were altered in the region. While two of the 
modifications (T to C at position 930 and A to 6 at 933) 
were inserted as expected, an additional change (T to C 

15 at position 936) was found (Figure 9). This 

unanticipated alteration was found in all five clones 
examined, suggesting that it was introduced by an error 
in the synthesis of the oligonucleotide primer. 
Nevertheless, this error was located outside of the "-35" 

20 prokaryotic consensus sequence (Hawley and NcClure 1983) 
and did not alter the encoded amino acid (Figure 3) . 

Two other undesired nucleotide changes were detected 
in the middle cDNA segment. Since these latter 
substitutions, probably introduced during PCR, would 

25 change the encoded* amino acids, it was necessary to 
replace the region containing these changes with a 
segment from the original plasmid. 

The DNA sequence in all three resulting clones 
appeared to be stable after long term propagation in e. 

30 coll. Theiif entire coding region was sequenced and no 
additional changes were detected. It was noted that the 
transformation efficiency of these constructs in bacteria 
increased substantially (100 times higher than the 
unmodified cDNA) and that the copy number of plasmids (as 

35 reflected by the yield in DNA preparation) also 
increased • 



WO 92/05252 




PCT/CA91/00341 



18 

Protein Synthesis by in vitro Translation 

To ensure that the open reading in the full-length 
cDNA was uninterrupted, the in vitro translation method 
was performed. Accordingly, the propagation vector was 
5 linearized at an appropriate restriction site in the 

circular plasmid and used as the template for production 
of RNA suitable for translation in vitro , since the 
entire CFTR protein would be probably too large to be 
translated In vitr°, the experiment also included shorter 

10 ' transcripts produced from templates interrupted within 
the coding region. 

An example of the result is shown in Figure 10. The 
template used for this experiment was pBQ6.2 linear! zed 
at the Hpal site; the in vitro translation product was 

15 expected to be 86.2 kd. As shown in Figure 10, a band 
migrating at the position of the expected molecular 
weight is clearly visible, indicating that translation 
initiates at the ATG codon as predicted and continues 
through the Hpal site in exon 13. The identity of this 

20 protein is also confirmed by its ability to react with a 
monoclonal antibody against CFTR. The other, smaller 
proteins observed in the products are probably results of 
premature termination or intenal initiation of 
translation. 

25 similar experiments were performed with the plasmid 

treated with Xbal and EcoRI. Protein bands corresponding 
to the expected translation products 19 and 77.5 kd, 
respectively, were readily detectable on the 
polymery lamide gel. These results, therefore, provided 

30 further confirmation that the predicted open reading 
frame was correct and intact in the full-length cDNA 
constructs. 

Expression in Mouse L Cells 

To examine if CFTR could be produced in heterologous 
35 mammalian cells, the pCOF-1 plasmid was used to transfect 
mouse LTK-cells. Another plasmid containing the herpes 
simplex virus TK gene was included in the transfection to 
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allow biochemical selection of cells that were 
successfully transfected. A plasmid (pCONZ) which 
suffered a single base pair deletion immediately adjacent 
to the 3' initiation codon was used in a parallel 
5 transfection experiment as a negative control (as the 
frameshift would result in no CFTR products) . 
DNA Analysis 

HAT-resistant L cell colonies were isolated and 
expanded into individual , mass cultures and genomic DNA 
10 were isolated from these cultures* for characterization of 
integrated plasmid DNA. As expected, all of the HAT- 
resistant L cell lines was found to contain an intact HSV 
TK gene (as demonstrated by gel -blot hybridization 
analysis) . In addition, the copy number for the 
15 integrated plasmid DNA was found to vary among different 
lines, an anticipated from the calcium phosp.iate co- 
precipitation protocol. Gel-blot hybridization analysis 
was then performed for the cell cultures transfected with 
pCOF-1 and pCONZ. The full-length cDNA clone (a 6.2 kb 
20 Pstl fragment from pBQ6.2) was used as proof to examine 
the CFTR sequence. 

Among the 15 pC0F-l/pSTK7 co-trans fee ted cell lines, 
14 were found to contain at least a portion of the CFTR 
cDNA, as judged by the hybridization pattern of EcoRI- 
25 digested DNA from these lines. Five of these lines shows 
a 1.5 kb and 2.5 kb band, as predicated for an intact 
cDNA. For the remaining lines, the size of the EcoRI 
fragment was different from those predicted, indicating 
rearrangement or integration occurred through this region 
30 of the cDNA. tfpon further analysis with BamHI and Ncol 
digestion, however, only four of the five lines showed 
intact 5 f and 3 f ends of the cONA (see Figure 11). The 
apparently intact clones were ^a-2C, 4a-3I, 4a-3K, and 
4a-4S. 

35 Similar DNA analysis was performed for cell lines 

co- transfected with pCONZ and pSTK7. Among the 10 clones 
examined, nine appeared to contain CFTR sequence and 
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eight of then showed the predicted 1.5 kb and 2.5 kb 
Ecorl fragments (see Figure 12) . Further analysis with 
BamHl and Ncol on five of then revealed that four (6a-lD; 
6a-2F? 6b-J and 6b-K) contained intact 5« and 3« ends. 
5 RNA Analysis 

Expression of CFTR in the transformed mouse cell 
lines was next examined by RNA blot hybridization 
analysis. Two of the pCOF-1 transfected lines (4a-3I and 
4a-3K) were found to express high levels of RNA (see 

10 Figure 12a) but the remaining two lines 4a-ic and««a-4Q) 
were low. in addition to the 6.2 kb band expected for 
correctly initiated and terminated CFTR RNA in these 
lines; however, hybridizing sequences were also detected 
at the 9-10 kb range, suggesting improper initiation or 

15 termination. A similar, high expression pattern was 
observed for three of the lines transfected with pCONZ 
(6a-lA; 6a-lD and 6b-J) (Figure 12a). For clone 6a-2F, 
only a low level of the 6.2 kb species was detected. No 
RNA could be found for clone 6b-K, despite the presence 

20 of an apparent intact CFTR sequence. 

The level of CFTR-derived RNA present varied among 
the cell lines mentioned above, with the strongest 
signals seen in clones 4a-3I and 6b-J. The variation was 
not due to poor sample preparation, as confirmed by 

25 hybridization analysis of the same blot with the TK probe 
(Figure 12B) . 
Protein Analysis 

Total protein was then extracted from several 
representative cell lines and examined for the presence 

30 of. CFTR-related products. The • interpretation of the 
results of this experiment was complicated by the 
presence of cross-reactive materials present in the mouse 
L cell, although no CFTR-related RNA was detectable. 
Figure 13 is an illustration in drawing format of the 

35 Western blot results. These results are based on a 

monoclonal antibody (L12B4) prepared against CFTR which 
is able to detect faint bands (at regions 170, 90, and 74 
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Jed) in all cell extracts. Nevertheless, the intensity of 
the 170 led band, the one expected for the CFTR protein, 
was clearly much stronger in 4a-3I, the cell line 
containing an intact full-length CFTR cDNA. Only 
5 background bands were detected for 2a-4A (containing 

pSTK7 only), 4a-2D (incomplete pCOF-l sequence), and 6b-J 
(pCONZ). The presence of the CFTR protein in 4a-3I was 
also confirmed in partially purified membrane fractions. 
1) A single 170kdalton band was observed in the 
10 extract from the cell line 4a-3I. This cell had been 
demonstrated by DNA analysis to contain multiple copies 
of the intact promoter of the expression vector and of 
the CFTR gene. RNA analysis also indicated expression of 
high levels of mRNA for this clone. 
15 2) In contrast, the cell line 6b* J, generated with 

the mutant gene does not express a reacting 170kdalton 
band. This line generated by co-trans feet ion with the 
pCONZ plasmid was shown to have an intact promoter and 
gene portion. In addition, high levels of RNA were 
20 detected. Identical results for these samples were 
obtained even when the extracts were not subjected to 
heating prior to gel loading. 

3) 170kdalton bands were not observed in the cell 
line 4a-2D generated by the transfection experiment with 
25 the pCOF-l plasmid and the cell line 2a-4A generated by 
the transfection with the pTK7 plasmid only. Genomic DNA 
analysis of the cell line 4a-2D indicated that only 
portions of the CFTR gene had been incorporated. As 
expected, no CFTR RNA could be detected. 
30 In conclusion, the cell line 4a-3I appears to be 

producing a 170Jcdalton CFTR protein as expected from the 
open reading frame of the pCOF-l plasmid. For 
comparison, also shown on the protein analysis figure are 
crude membrane preparations of the cell lines Panc-1 (a 
35 negative control) and T84 (pgoyided by N. Kartner and J. 
Riordan) . Multiple bands care observed in the T84 
preparation. These differences suggest that post- 
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translational modification including glycosylation and/or 
phosphorylations have occurred to the CFTR protein in T84 
cells. 

A CAMP-inducible chloride permeability has been 
5 detected in mouse fibroblast (L cell) lines upon stable 
integration of a full-length cONA encoding the human 
cystic fibrosis transmembrane conductance regulator 
(CFTR) . As indicated by a Cl*-indicator dye, the CI" 
permeability of the plasma membrane increases by 10-to 

10 30-fold within 2 min after treatment of the cells with 
forskolin, an activator of adenylyl cyclase. The 
properties of the conductance are similar to those 
described in secretory epithelial cells; the whole-cell 
current-voltage relationship is linear and there is no 

15 evidence of voltage-dependent deactivation or activation. 
In contrast, this cAMP-dependent CI* flux is undetectable 
in the untransected cells or cells harboring defective 
cDNA constructs, including one with a phenylalanine 
deletion at amino acid position 508 (aP508) , the most 

20 common mutation causing cystic fibrosis. Those 

observations are consistent with the hypothesis that the 
CFTR is a cAMP dependent CI* channel. The availability of 
a heterologous (nonepithelial) cell type expressing the 
CFTR offers an excellent system to understand the basic 

25 mechanisms underlying this CFTR-associated ion 

permeability and to study the structure and function of 
the CFTR. 

We describe the construction and use of a mammalian 
expression vector to produce human CFTR in a long-term 

30 mouse fibroblast culture. We show that expression of 
CFTR induces a cAMP-dependent CI* conductance, which is 
normally not observed in these cells. This expression 
system is suitable for study of the CI* conductance 
pathway and its regulation and to provide medical 

35 treatment for CF. 
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Plasmid Vectors 

The mammalian expression vector pCOF-l is a 
derivative of pSGM3X, which is similar to pSMGl (Meakin 
et al) , except that the human metallothionein Ila 
5 promoter (Karin et al, 1982) was inserted in the opposite 
orientation and a Xho I site was inserted in the Kpn I 
site within the Ecogpt gene. To reconstruct the 

full-length CFTR cDNA in pCOF-l (Fig. 14), the bulk of 
the coding region (exons 2-24) was obtained from partial 

10 cDNA clones (Riordan et al, 1989), except that the three 
silent nucleotide substitutions (T C at position 930, A 
■» 6 at position 933, and T C at position 936) were 
introduced into the exon 6b region with oligonucleotide- 
mediated mutagenesis by the polymerase chain reaction 

15 (19, 20). The 3' untranslated region of CFTR in pCOF-l 
was derived from the genomic DMA clone TE27B2.3 (Rommens 
et al, 1989). The entire coding region of exon 1 (from 
the initiation codon to the Pvu II site) was generated by 
two complementary synthetic oligonucleotides and the 

20 KLenow fragment of DNA polymerase I, where a single 
nucleotide substitution (C -> G) was introduced 
immediately after the initiation oodon (underlined in the 
legend to Fig. 14) to create a Nco I site for ligation to 
the human metallothionein. Ila promoter. The latter 

25 substitution changed the encoded amino acid glutamine to 
glutamic acid. The construction of control plasmld pCONZ 
was similar to that of pCOF-l, except that a single 
nucleotide was deleted 35 base pairs downstream from the 
initiation codon. A truncated protein would be predicted 

30 from this frameshift construct. The plasmid pCOFAF508 
was generated by replacing sequences of exons 9-13 in 
pCOF-l with the corresponding fragment from Cl-1/5, a 
cDNA containing the aF508 mutation. The fu i -length cDNA 
clone pBQ6.2 contained a 6.2 kb Pst I fragment in 

35 pBluescript (Stratagene) and was constructed similarly to 
pCOF-l except that the exon 1 region was derived from 
clone 10-1. The integrity of the CFTR cDNA inserts in 
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PBQ6.2, pCOF-1, and the critical regions in the other 
plasmid constructs were verified by DNA sequencing. The 
plasaid vector (pSTK7) containing the herpes simplex 
virus thymidine kinase gene, used in the cotransfection 
5 experiments, has been described (Meakin et al) . 

Bacterial cell cultures and plasmid DNA samples were 
prepared according to standard procedures (Sambrook et 
al, 1989). 

Cell culture and DMA Transfeotion 

X0 • Each of the three test plasmids (20 /*g) , pCOF-1, • 
pCONZ, and pCOFaF508, was cotransfected with pSTK7 (1 ftq) 
into mouse LTK* cells by calcium phosphate coprecipitation 
(Meakin et al) . Biochemical selection for thymidine 
kinase-positive cell was achieved in minimal essential 

15 medium supplemented with 

hypoxanthine/aminopterin/thymidine (HAT;gibco/brl) . in 
some experiments, the test plasmids were linearized at 
the unique fl£i I site (Fig. 14) . High molecular weight 
DMA was isolated from each clonal cell line (Miller et 

20 al, 1988), digested with restriction enzymes KcoRI. 
fi&aHI, and Hsfi 1/ and analysed by agarose-gel-blot 
hybridization (Sambrook et al, 1989) with the full-length 
cDKA (insert from pBQ6.2) as probe. Total RNA was 
extracted (MacDonald et al, 1987) and analyzed by 

25 agarose-gel-blot hybridization (Sambrook et al, 1989).' 
Protein Analysis 

Cells were homogenized in a hypotonic buffer 
containing 10 mM KC1, 1.5 mM MgCl 2 , and 10 mM Tris-HCl (pH 
7.4). Nucleic and mitochodria were collected by 

30 centrifugation at 4000 x g f or 5 min (fraction A) . A 
crude membrane fraction was then collected by 
centrifugation at 9000 x g for 15 min (fraction B) . 
Membrane pellets were dissolved in loading buffer and 
separated on a NaDodSO^/polvacrylamide (6%) gel (Laemmli, 

35 1970) . Proteins were transferred to nitrocellulose as 
described (Towbin et al, 1979) and immunodetected with 
monoclonal antibody M3A7 (Kartner et al, 1991). 
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6-Methoxy-l- (3-sulf onatopropyl ) quinolinium (8PGQ) 
Fluorescence Assay 

L cells grown on glass coverslips for 1-2 days were 
uniformly loaded with the CI" -indicator dye SPQ by 
5 incubation in hypotonic (1:2 dilution) medium containing 
20 nN SPQ at room temperature for 4 min. The mounted 
coverslip was perfused continuously at room temperature 
with medium containing 138 mN NaCl, 2.4 mM K 2 HP0 4 , 0.8 mM 
KH 2 P0 4 , 10 mM Hepes, 1 mM CaCl 2 , 10 mM glucose, and 10 mM 

10 bumetanide (pH 7.4) on the stage of an inverted 

microscope. NO~ 3 medium was identical except that NO" 
3 replaces all but 10 mN CI". To minimize CI" fluxes 
through nonconductive pathways, we performed the 
experiments in the absence of HCO* 3 and in the presence of 

15 bumetanine, inhibiting the anion exchanger and Cl"/cation 
cotransporters, respectively. Fluorescence and the 
differential interference contrast imagine were performed 
simultaneously (25-27) . SPQ fluorescence intensities (7) 
were normalized to total SPQ fluorescence F 0 , determined 

20 as F measured in the absence of intracellular CI", since 
autofluorescence was negligible, Calibration (n « 7 
cells) were performed essentially as described (Foskett, 
1990) . The effective quenching constant Kct was 15 M" 1 ; 
the testing intracellular CI" concentration was » 70 mM. 

25 Cell volume changes were obtained by planimetry of 

differential interference contrast images (26, 27) and 
are presented as relative changes in the areas of the 
measured optical sections. By exposure of the cells to 
media of various osmolarities, we observed that 

30 differential interference contrast imaging of a single 
optical section can detect volume changes. 
Whole-Cell current Recordings 

Membrane currents were measured at room temperature 
12-24 hr after plating cells with whole-cell patch-clamp 

35 technigues (Hamill et al, 1981). The patch pipet 

contained 110 mM sodium gluconate, 25 mM NaCl, 8 mM MgCl 2/ 
10 mM hepes, 4 mM Na^TP, and 5 mM Na 2 EGTA (pH 7.2) . The 
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bath contained 135 mM NaCl, 2.4 ON KjHPO*, 0.8 aM DH 2 P0<, 3 
mM MgCl 2 , 1 mM CaCl 2 , 10 mM Hepes, and 10 mM glucose (pH 
7.2). To examine the time course of a cAMP- evoked 
conductance changes, membrane potentials were alternately 
5 clamped at -30 and +20 mV for 600 and 400 ms, 

respectively. Current-voltage relationships were 
determined by measuring the currents at the end of 400 - 
ms voltage steps from 0 mV to ± 70 mV (10 mV increments) . 
Cell capacitance was compensated using cancellation 

10 circuitry of the EPC7 patch-clamp amplifier. 
DMA Trans f action 

Stable mouse fibroblast cell lines containing full- 
length CFTR or mutant cDNA, as well as cell lines with 
the pSTK7 plasmid alone, were established. Four cell 

15 lines with pCOF-l (4a-2C, 4a-3I, 4a-3K, and 4a-S) 

contained an intact human metallothionein Ha promoter 
and the CFTR coding region, as judged by DMA analysis 
(data not shown) . Similarly, four cell lines (6a-lD, 6a- 
2F, 6b-l, and 6b-K) for pCONZ control cDNA with the 

20 frame-shift mutation three for dCOFaF508 (5-2C, 5-1A, and 
5-2D) cDNA with the major CF mutation and two for pSTK7 
(2a-4A and 2a-3C) were identified) . 
KM*, analysis 

Abundant levels of RNA transcripts of » 10 kb in 
25 size were detected in cDHA transected cells Fig. 15A. 
Although their size was larger than the anticipated 6.2 
Jcb, the result appeared to be consistent among all cell 
lines. It seemed probable that an alternative 
polyadenylylation site(s) instead of those contained in 
30 the cDNA constructs were utilized. Thymidine kinase- 

specif ic transcripts were detected in all HAT -resistant 
cell lines tested. 
Protein Analysis 

Cells expressing CFTR mRNA (as represented by the 
35 cell line 4a-3l) contained an antibody-reacting protein 
band that was indistinguishable from mature CFTR 
expressed endogenously in membranes of the colonic 
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carcinoma cell line T-84 (Fig. 15B) . The amount of 
protein was within the range of that observed for T-84 
cells, with a significant portion in the light membrane 
fraction. Reacting bands were not detected in 
5 untransfected LTK" cells or in cell transfected with CFTR 
gene predicted to produce a truncated product (line 
6B-I) . The latter result was expected as the antibody 
was directed against the C terminus of the protein. No 
immunoreactive material was observed for CFTRaF508 in the 

10 examined fractions (line 5-2D) (Cheng et al, 1990). It 
was therefore uncertain if the mutated protein was 
produced in these transfected cells. 
SPQ Fluorescence Assay 

To investigate if expression of CFTR affected CI" 

15 conductance, a single-oell assay based on quantitative 
fluorescence intensity measurements of the Cl" indicator 
dye SPQ was performed. The basic protocol involved 
exposing cells to N0~ 3 medium followed by return to normal 
Cl" medium. Since NO"; is generally permeable through Cl" 

20 channels but, unlike Cl", does not quench spq, CHANGES IN 
spq fluorescence intensities due to these anion 
substitutions measure unidirectional Cl" fluxes; the rate 
of change measures cell Cl' permeability. 10 MM forskolin 
was added subsequently to increase intracellular levels 

25 of cANP. After a 2-m* i exposure to forskolin, the medium 
was again switched to the NO~ 5 medium, in the continued 
presence of forskolin. Thus each cell was used as its 
own control to evaluate the Cl" permeability induced by 
cAMP. 

30 Only slow changes in SPQ fluorescence intensity 

were observed in cells with the frame-shift DFTR 
construct (as represented) by line 6B-X) upon exposure to 
N0~ 3 medium (Fig. 16A) , indicating that these cells 
maintained a low resting Cl" permeability. Exposure of 

35 these transfected control cells to forskolin did not have 
any effect (n » 41 cells from four passages) . Similar 
results were obtained from cells transfected with the 
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thymidine kinase gene only (line 2a-4A, n - 22 cells) and 
from untransfected cells (n = 18 cells) . 

Exposure of cells expressing CFTR to N0" 3 medium 
similarly elicited little or no change in SPQ 
5 fluorescence intensity (as represented by line 4a-3I in 
Fig. 16C) , indicating that CFTR expression per se did not 
enhance CI" permeability. In contrast, a second exposure 
to N0" 3 , during forskolin stimulation, caused a rapid loss 
of intracellular CI" (Fig. 16C) . This response was highly 

10 reproducible; there was a 20- to 30-jft>ld increase of 

unidirectional CI" flux (Illsley et al, 1987) from * 0.03 
mM/s to 0.9 mM/s for each of the 10 cells in the 
microscopic field (Fig. 16C) . Of 106 cells examined from 
eight passages, all responded similarly. Similar results 

15 were obtained from another cell line expressing CFTR 

(clone 4a-3K, n - 23 cells) . In the continued presence 
of forskolin (Fig. 16D) , CI* permeability remained 
enhances at near maximal levels for as long as 3Q min (n 
- 12 cells). 

20 There were no volume changes during exposure of 

forakolin-stimulated CFTR cells to N0. 3 , indicating that 
the substantial changes in the intracellular 
concentration of CI' were not associated with changes in 
cell salt content. Therefore, the Cl" fluxes were likely 

25 to be associated with equal fluxes of N0" 3 in the opposite 
'direction. The lack of changes in intracellular Cl" 
concentration or cell volume during forskolin stimulation 
in Cl" medium demonstrates that neither CFTR expression 
nor cAMP conferred enhanced cation conductance. Together 

30 with the enhanced Cl" permeability, cell shrinkage or 

swelling would have been observed if there were a high K* 
conductance or high Na* conductance, respectively. 

Cl" permeability was also examined in cells 
containing a CFTRaF508 construct. These cells (as 

35 represented by clone 5-2D, n - 15 cells) exhibited low Cl" 
permeability under resting conditions and the 
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permeability could not be enhanced by forskolin (Fig. 
16B) . 

To establish that the cAMP-induced CI" permeability 
in the CFTR-expressing cells was due to activation of a 
5 CI" conductance/ gramicidin was included in the normal CI" 
medium to increase cation conductance of the plasma 
membrane. Under these conditions, the presence of CI" 
channel would result in a substantial influx of both Na* 
and CI", causing extensive cell swelling. Exposure to 

10 gramicidin had no effect on SPQ fluorescence or cell 

volume in cells with the frame-shift CFTR construct (n - 
13 cells) or an intact CFTR construct (n « 33 cells) 
(Fig. 16E) , supporting our conclusion (above) that 
resting CI" conductance was negligible in both the control 

15 and CFTR cells. After the addition of forskolin, 

however, there was a marked rapid cell swelling, after a 
lag period of from 30 to 90 s, accompanied by elevated 
intracellular CI" concentration in the cells expressing 
CFTR but not in control cells (Fig. 16E) • These results 

20 demonstrated that the basis of cAMP- induced CI' 

permeability observed in CFTR-expressing cells was a Cl" 
channel • 

oAKF-8ti*ulated Cl" currents in the CFTR-Bxpressing Cells 

Corroborating the results of the fluorescence assay, 
25 whole-cell current was stimulated in cells expressing 
CFTR (Fig. 17A) . After a lag period of * 30 s, outward 
current in 11 of 11 cells increased dramatically to a 
plateau, which was sustained for 2-5 min before 
decreasing toward control values. This "run down w 
30 contrasts with findings using the SPQ assay, possibly 

reflecting a depletion of cytosolic factors necessary for 
sustained activation in the whole-cell patch-clamp 
configuration, currents evoked by the activation 
cocktail did not display any time-dependent voltage 
35 effects (Fig. 17B) contrast to the expressing cells, none 
of the 9 cells containing the frame-shift CFTR construct 
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exhibited a response to the activation cocktail (Fig. 
17A) . 

Current-voltage relations were essentially linear in 
both cell transfected with the intact CPTR and the frame- 
5 shift CPTR constructs (now shown) . Slope conductances of 
1.8 ± 0.4 ns and 1.9 ± 0.5 nS (n - 9) were calculated in 
control cells before and after treatment with the 
activation mixture, respectively. The slope conductance 
in CFTR-expressing cells was similar (1.6 ± o.l nS) but 
10 treatment with the activation mixture induced an « 13- 
fold increase to 20.1 ± 1.7 nS (n - 10) (Fig. 17C) . 

The reversal potential of the cAMP-activated current 
in cells expressing CFTR was -17 ± 5 mV, approaching the 
equilibrium potential for cr under standard conditions 
15 (Ed - -32 mV) . From this measurement, the calculated 
anion versus cation permeability was * 5:1. Anion 
selectivity was assessed in cAMP-activated cells by 
replacing bath NaCl (135 «M) with sodium gluconate (135 
mM) . This manipulation resulted in a shift in reversal 
20 potential to +33 ± 8 mV (n - 4) , toward the predicted Eci 
of +41 rnv (Fig. 17C). 

The inability to construct a full-length CFTR cDNA 
has hampered progress in understanding the structure and 
function of the protein. The difficulty is mainly due to 
25 the instability of the full-length sequence in b. cou, 
where there was sequence rearrangement often associated 
with a short (13 bp) direct repeat in exon 6b. Through 
the modification of DMA sequence within the first copy of 
the repeat, however, we have succeeded in the 
3lf construction of three different plasmids each containing 
the entire coding sequence. These plasmids replicate * 
efficiently in e. com (DH5a) , and are intact after 
prolonged propagation. Although this particular 
modification involves the first repeat, it is understood 
35 that this second repeat can also be similarly modified 
instead of, or in addition to, the first repeat to 
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achieve similar success in the propagation of the CFTR 

cDNA. 

In addition to verification by direct DNA 
sequencing, the plasmids have also been examined for 
5 their ability to product proteins of expected sized in 
vitro and in vivo . Based on the longest open reading 
frame of the consensus cDNA sequence (Riordan et al 
1989), a protein of 170 kd (1480 amino acids) has been 
predicted as the CFTR gene product. The fact that a 170 
10 kd band was detectable in the product translated in vitro 
and in cells transfected with the full-length CFTR cDNA 
confirms the original prediction. 

The availability of a full-length CFTR cDNA that can 
be expressed in mammalian and non-manual ian cells offers 
15 the opportunity to perform a detailed structure and 

function analysis of CFTR. The vectors described here 
are excellent tools for this purpose. With appropriate 
regulatory sequences inserted upstream of the coding 
region, it is understood that large quantities of CFTR 
20 may be produced through different kinds of heterologous 
gene expression systems, whereby various biochemical and 
biophysical studies can be performed. 

The ability: to express the full-length cDNA also 
allows development of functional assays for CFTR. In 
25 this context, Drumm et al (1990) have demonstrated 

through a retrovirus- intermediate that the modified full- 
length cDNA described here was able to confer the 
function of CFTR in a pancreatic carcinoma cell line 
(CPFAC-1) derived from a CF patient. Upon proper 
30 expression of the cDNA, the cAMP-mediated chloride . 
transport activity was restored in this cell line, 
providing the first example of functional complementation 
of CFTR activity. The ability to confer CFTR expression 
in heterologous cells is an important step towards the 
35 possibility of gene therapy in the lung and pancreas of 
CF patients. 
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In order to understand the role of individual amino 
acid residues as well as regions of the protein, site- 
directed mutagenesis may be used to introduce additional 
mutations into the coding region of CFTR. The functional 
5 assay may also be used to confirm if the sequence 
alterations detected in CF patients are bona fide 
disease-causing mutations. The latter consideration is 
important if broad scale disease diagnosis and carrier 
screening based on DNA information are to be implemented. 

10 Further, since there is a general lack of genotypes, 

permanent cell lines in which the CF phenotype resists, . 
the ability to generate heterologous cell lines capable 
of expressing various defective CFTR offers an 
alternative approach in understanding the function of 

15 CFTR and in development of rational therapy. In the 

latter regard, procedures can be devised for screening of 
compounds that would Interact with the defective protein 
and restore its function. 

To introduce additional mutations into the coding 

20 region of CFTR, it is possible to replace regions of the 
cDNA with altered sequences, as demonstrated by the 
examples described above. The procedure is difficult, 
however, because many of the restriction enzyme sites 
involved are present in more than one vector. For this 

25 reason, it is desirable to include unique restriction 

sites in the coding region of the cDNA. For example, by 
introducing a silent change (T to C) at position 1944 in 
exon 13, a novel BstBII site is created at the end of the 
sequence corresponding to the first NBF1 is probably the 

30 most interesting region in CFTR because about one-third 
of the disease-causing mutations reside in the regibn. 
In combination with another unique site, such as SphI, it 
becomes extremely easy to replace sequences for NBF1. 

Finally, the full-length cDNA construct contained in 

35 the pBQ4.7 and pBW6.2 may be excised in its entirety by a 
single PstI digestion or a double digestion with a 
combination of Sail, Xhol, Smal or EcoRI. This 
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versatility allows the cDNA to be transferred from the 
current vector to other host-vector systems. 
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THE EMBODIMENTS OP THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 



1. A modified DNA sequence derived from a gene 
5 coding for cystic fibrosis transmembrane conductance 
regulator (CFTR) protein, said gene having at least 27 
exons of which normal cDNA codes for CFTR protein, said 
normal cDNA including exons 6a and 6b wherein exon 6b 
includes a 13 bp repeat, 

10 said modified DNA sequence comprising *at least one 

of said 13 bp repeats of exon 6b having one or more 
normal nucleotides of said 13 bp repeat substituted with 
an alternate nucleotide which continues to code for a 
corresponding normal amino acid. 

15 2 . A modified cDNA sequence of claim 1 wherein 

said one or more normal nucleotides are at positions 930 
and 933 and said substitutions are C for T at 930 and G 
for A at 933. 

3. A DNA construct for use in a recombinant vector 
20 comprising said modified cDNA of claim 1. 

4. A vector comprising a DNA construct of claim 3, 
and a promoter sequence for said DMA construct. 

5. A vector of claim 4 wherein said vector 
comprises a restriction map of Figure 5 and 8. 

25 6. A host cell for producing CFTR protein, said 

host cell comprising a compatible y*ctor of claim 4 
whereby expression of said vector in said host cell 
produces CFTR protein. 

7. A host cell of claim 6 selected from the group 
30 consisting of bacterial and mammalian cells. 

8. A CFTR protein isolated and purified from 
culture of a host cell of claim 6. 

9. A mammalian cell transf acted with a vector of 
claim 4 to enhance CI' conductance through cell wall. 
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1 AATTGGAAGCAAATGACATCACAGCAGGTCAGAGAAAAAGGGTTGAGCGGCAGGCACCCA 



6 1 GAGTAGTAGGTCTTTGGCATTAGGAGCTTGAGCCCAGACGGCCCTAGCAGGGACCCCAGC 

MORSPLEKASVVS KLF 
121 GCCCGAGAGACCATGCAGAGGTCGCCTCTGGAAAAGXSCCAGCXSTTGTCrCCAAACTTTTT 

FiMTRPILR K G Y R 0 R L B L 8 _P 



181 TTCA< 



^CTGGACCAGACCAACTTTGAGGAAAGGATACAGACAGCGCCTGGAATTGTCAGAC 



16 
36 



TYOIPSVDSADNLSBKLErfE 56 
241 ATATACCAAAT^XXTTCTGTTGATTCTGCTGACAATCTATCTGAAAAATTGGAAAGA 

i* n p F LASK KNPKL I NALRRC 76 
301 TGGGATAGAGAGCTGGCTTCAAAGAAAAATCCTA^ 

r r M »l u p v R T F T v T - R I * V T K A I 96 
361 TTXnCxScMfYm^ 

I ... « i J ri GRI1ASYDPDNKEE 116 

< 21 GTACAGCCTCTCTT^ 

481 CG CTCTATCGCIjATTTATCTAGOCATAGGCCTATGCC^ 

n — n hpaipglhbicmqmrirm ise 

1 It JM - * 1. _* 1 — f ~ — » »TfirXfiATGAGAATAGCTATG 



176 

EXON6. 



541 CTCCTAOICCCAGCCAT 

p s T. ft X X I T L K It 8 8 R V L D K I 8 

601 TTT AGTTTGATTT ATAAGAAGKcnTTAAAGCTCTCAAGCCGTGTTCT 

I GOLVSLL8NNLNKFDE Ig | L A I 
661 ACTCCACAACTTGTTAtfiVJXXTTTCCAAC*^ 

p t t • r>m — 5 « i c c t. e » T. T V T. * T, rl 236 
7B1 GWrTTGTTACAGGCtiTCTG 

841 fexGGCIGGOTAMGAGWT^ 



EftLVlTSEMIEMIOSVXAYC 

G#MUIGACCTGT6M!TACCTC 

3*°" 6- 9J0 933 936 H • 

MKEAMEKMIBMLRdTELKLT 

961 TGO»AaEAG<^TG^ 



276 
296 



RKAAYVRYFH S 1 S ft F F F S g T , ]g_ 3" 



336 
356 



1 02 1 CGGAAGGCAGCCTATGTGAGATACTTCAATAGCTCAGC CTTCTTCTTCTCAGGGTTCTTT 

ltfif gi.ftVT.PYAl.il ^ R T I T. R K 15 
108 1 GTGGTGTTTTTATCTGTGCTTCCCTATGCACTAA 

IpTTTSPCTVt. PMAVl T R 0 F P H 
1141 xTCACCACCATCTCATTCTGCAriGTTCIXX^ATGGCG 

AVQTNYD8LGAXMKI 0 ID FLO 376 

1201 gctctacaWatgctatgactctcttggagc^^ 

KOEYXTLEYMLTTTBVVMEN 396 
1261 AAGCAAGAATATAAGACATTCGAATArAACTTAACGACTACAG^GTAGTGATGGAGAAT 

VTAFWEBlGFGBLFEXAXQMN 416 
1321 GTAACAGCCTTCTGGGAGGAGpGATTTGGGGAACTAXTTGAG^ 

iocttTI JTE SHEET 
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NNRKTSNGDDSLFFSN rgLL 436 
1381 AACAATAGAAAAACTTCTAATGGT6ATGACAGCCTCTTCTTCAGTM1TTCTCACTTCTO 

GTFVLKDINrKIBRCOLLAV 456 
1441 GGTACTCCTGTCCTGAAAGATATTAATTTCTiAGATAGAAAciAGGACAGTTGTTtGCdGTT 

AQSTGAOKlTflLLMKIMOBLB 476 

1501 GCTGGATCCACTGGAGCAGGCAAt^CTrcACTTCTAATGATGATTATGGGAGAACTGGAG 

psecKiKHSORiar ca orgw 496 

1561 CCTTCAGAGGGTAAAATTAAGCACAGTGGAAGAATTTCATTCTGTTCTCAGTTTTCCTCG 

• A 

IMPCTIKEWIirO VaYDKYK 516 

1621 ATTATGCCTGGCACCATTAAAGAAAATATCATCTTTGGTGTTrCCTATGATGAATATAGA 

TRaVIKXCQLg BID I L LLi-L! 536 
1681 TACAGAAGCGT CATC AAAGCATCCCAACTAGAAGA03ACATCTCCAACTTTGCAGAG AAA 

DMIVLCKCOITLgQCQnt^i 556 

1741 GACAATATAGTTCTTGGAGAAOGTGGAATCACACTGAGTGGAGGTCAACGAGCAAGAAn 

a LARIaVYXDADLTLIiP B 9 r a 576 
1801 TCTTTAG<AAGIVGCAGTATACAAAGATGCTCATTTGTATTTATTAGACTCTCCrri*fGGA 

YL DVT.TKK B X F E S] C V C K L MA 596 
1861 TACCTAGATGTTTTAACAGAAAAAGAAATATTTGAAAC|?TGTGTCTGTAAACTGAT<kiC'r 

NKTRXLVTSKMEHLXXADKX 616 
1 92 1 AACAAAACTAGGATTTTGGTCACTTCTAAAfcTCGAACATTrAAACAAAGCTGAC^ 

LILHEGSSYFT GTFSELOKL 636 
1981 TTAATTTTGCATGAAGGTAGCAGCrATTTTTATGGGACATTTTCAGAACTCCAAAATCTA 

QP DFSSXLMGCDSFDQF8AE 656 
2041 CAGCCAGACTTTAGCTCAAAACTCATGGGATGTGATTCTTTOGACCAATTTAGTGCAGAA . 

RRN8 I L TETLHRFS L'BG DA P 676 
2101 AGAAGAAATTCAATCCTAACTGAGACCTTACACCGTT T C TCATTAGAAOGACATGCTCCT 

VSWTETKKQS*FKQTGEFGEK 696 
2161 GTCTCCTGGACAGAAACAAAAAAACAATCTTTTAAACAGACTGGAGAGTTTGGGGAAAAA 

RKnIiLNPINSXRKFS IVOK 716 
2221 AGGAAGAATTCTATTCTCAATCCAATCAACTCTATACGAAAAT"TTTCCATT^ 

TPLQMNGI EBD SDEPLBRRL 736 
2281 ACTCCCTTACAAATGAATGGCATCGAAGAGGATTCTGATGAGCCTTTAGAGAGAAGGCTG 

SLVPDSEQGEAI L P R I SVIS 756 
2341 TCCTTAGTACCAGATTCTGAGCAGGGAGAGGCGATACTGCCTCGCATCAGCGTGATCAGC 

TGPTLQARRRQSVLNLMTBS 776 
2401 ACTGGCCCCACGCTTC*GGCAtt5AAGGAGGCAGTCTGTCCTG^ 

VNQGQN I HRKTTASTRKVSL 796 
2461 G TTAACCAAGGTC AGAACATTCACCGAAAGACAACAGCATCCA CACGAAAAGTGTCACTG 

APQANLTELD IYSRRLSQET 816 
2521 GCCCCTCAGGCAAACTTGACTGAACTGGATATATATTCAAGAAGGTTATCTCAAGAAACT 
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GLEISEEI NEEDLKIecFFDD filfi 
2581 GGCTTGGAAATAAGTGAAGAXATTAACGAAGAAGACTTAAAqSAGTGCCTTTTTCATGAT 

MESIPAVTTWNTYLRY ITVH 856 
2641 ATGGAGAGCATACCAGCAGTGACTACATGGAACACATACCTTCGATATATTACTGTCCAC 

* S Ll f F V L T It C I. VTP T. i r u T~71 8 76 
2701 AAGAGCTTAATTTTTGTGCTAATTTGGTGCTTAGTAATTTTTCTGGCAGAGGTGGCTGCT 

IS L V \0 L WLL G»<TPLOOKgTsI 896 
2761 TCTTTGGTTGTGCTGTGGCTCCTTGGAAACACTCCTCTTCAAGACAAAGGGAATAGTACT 

V 

HSRNN5YAVI ITSTS IS Y V V Tl 91* 
2821 CATAGTAGAMTAACAGCTATGCAGTGATTATCACCAGCACCAGTTCGTATTATGTGTTT 

ly I YVGVADTL LAMGFFl R 6 L P 936 
2881 TACATTTACGTGGGAGTAGCCGACACTTTGCTTGCTATGGGATTCTTCAGAGGTCTACCA 

LVHTLITVSKXLBHXMLBSV 956 
2 94 1 CTGGTGCATACTCTAATCACAGTGTCGAAAATTTTACACCACAAAATGTTACATTCTGTT 

LOAPHSTLMTLKAfcGILHRF 976 
3001 CTTCAAGCACCTATGTCAACXCTCAACACGTTGAAAGCAGmTGGGATTCTTAATAGATTC 

8KDIAILDDLLPL T I I F D F I ol 996 

3061 TCCAAAGATATAGCAATTTTGGATGACCTTCTGCCTCTTACCA'I Xl^ltJujVrCAl^CCAQ 

3121 ^^^^^^^^^^^g^c^sg 1016 

3181 ^^^^^G^AG^^A^G^^S^^^^^G^^^^^SwJc 1036 

SQOLKQLBSBGRSP IFTHLV 1056 
3241 TCACAGCAACTCAAACAACTGGAATCTGAAGGCAGGAGTCCAATTTTCACTCATCTTGTT 

TSLKCLWTLRAPCROPYrBT 1076 

3301 ACAAGCTTAAAACGACTATGGACACTTCGTGOCTTCGGACGGCAGCCTTACTTTCAAACT 

L F H X A L N L R T A HHP L ¥ I. s T r iak 
3361 CTGTTCCACAAAGCTCTGAATTTACATACTGCCAACTGGTTCTTGTACCTGTCA 

R N F 0 H R I I KM 1 r v j p p t * m i» d %\\c 

I 1 S I L T T ttl E G E G H V C T ~T t T i Jl llas 

3«1 ATTTCCATTTTAACAACA<fcAGAA(X^GAAG^ 1136 

IMWIMST T. ftWA V^YTI I D V D S Ll 1156^ 
3541 ATGAATATCATGAGTACATTGCAGTGGGCTG7AAACTCCAGCATAGATGTGGATAGCTTI* 

MRSVSRVFKFIDMPTEGKPT 1176 
3601 ATGCGATCTOTGAGCCGAGTCTTTAAGTTCATTGACATGCCAACAGAAGGTAAACCTArc 

KSTKPYKNGQLSKVHZXENS 1196 
3661 AAGTCAACCAAACCATACAAGAATGGCCAACTCTCGAAAGTTATGATTATTGAGAATTCA 

RVKXDD IW PSGGOMTVKDLT 1216 
3721 ^CGTGAAGAAAGATGACATCTGGCCCTCAGGGGGCCAAATGACTGTCAAAGATCTCA^ 

A K TTKOOHAILKWIgy 8 I B V 1236 
3781 GCAAMTACACACAAG(JTGGAAATGCCATATTAGAGAACATTTCCTTC 

gORiVGLLGR TCSCKgTLLgA 1256 
3841 GGCCAGAGGCTGGGCCTCTTGGGAAGAACTGGATCAGGGAAGAGTACTTTGT 

RI IRSTTCUTE. SHEET 
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P I» R L LHTEQEf oiDflvgy pg 1276 

3901 TTTTTGAGACTACTGAACACTGAAGGAGAAATCCAGATCGATGGTGTGTCTTGGGATTCA 

ITLQQWR KArcviPQKVyir 1296 
3961 ATAACTTTGCAACAGTGGAGGAAAGCCTTTGGAGTGATAC^CAGAAAGTATTCATTTCT 
• 

-S QTrRlCKLPPT BQWgDQKTW 1316 

4021 TCrGGAACATTTAGAAAAAACTTCGATCCCTATGAACACTO^CTGA 

K v A D l Mr QLRav iBorpoici.D 1336 

4081 AAAGTTGCAGATGAGtSTTGCGCTCAGATCTGTCATACAACAGTTTCCTGGGAAGCTTCAC 

I ▼ Jl- V DCO CVL8HCHKQLMCL 1356 
4141 TTTGTCCTTCTGGATGGGGGCTGTGTCCTAAGCCAIGG(XACAAGCAGTTGATGTG 

B 8VL 8KAKILLLDBPaAHI. 1376 

4201 GCTAGATCTGTTCTC»CTAAGGCGAAGATCTT<KTGCr^ 

„ # . PP. MTTQIIRll TLKOAPADCT 1396 
4261 GATCCAGI^CATACCAAATAATTAGAAGAACTCTAAAACAAGCATTTGCTGATTCCACA 

VILCERRIEAHLECOQFLlVI 1416 
4321 CTAATrCTCTOr<^CACA<^TAaUUX^TGCT^ 

EENKVRQYDSIQKLLNBRS L 1436 
4381 GAAGAGAACAAAGTGCGGCAGTACGATTCCATCCAGAAACTGCTGAACGAGAGGAGCCTC 

FRQAISPSDRVKLFPBRNSS 1456 
4441 TTOCGGCAAGCCATCAGCCCCTCCGACAGOGTCAACC r CTTTCCCCACOGGAACTCAAGC 

KCK8KPQIAALKEBTEEEVQ 1476 
4501 AAGTGCAAGTCTAAGCCCCAGATTGCTGCTCTGAAAGAGGAGACAGAAGAAGAGGTGCAA 

D T R L • 24g 0 
4561 GATACAA^GCTTTAGAGAGCAGCATAAATGTTGACATGGGACATTTGCTCATGGAATTGG 

tii} ^SJJS^^Jcctcaiggaa^^ 

4681 AAAACAAQGATQAATTAA GTT TT TTT T TA AAAAAGAAACATTTGGTAAGCGGAATTXSAGG 

A^TCATATGCCT<ri^TAA ATGGCTTCC^^ 

4801 TTCAAATOCTTOAAGATTTAOCA CTTGTGT 1TTGCAAGCCAGATTTTCCTGAAAACCCTT 

ill} GCCATGTGCTACTAATTt^SAAAGGCAGCTCTAAATGTCAATCAGCCT 

4921 AZ TGTCTAGTGAAACTCGTTAATTTGTAGTGTTGGAGAAGAACTGAAATCATACTTCTTA 

4981 GGGTTATGATTAAGTAATGATAACTGGAAACTTCAGCGGTTTATATAAGCTTGTATTCCT 

3041 TTTTCTCTCCTCTCCCCATGATGTTTAGAAACACAACTATATTGTTTGCTAAGCATTCCA 

5101 ACTATCTCATTTCCAAGCAAGTATTAGAATACCACAGGAACCACAAGACTGCACATCAAA 

5161 ATATGCCCCATTCAACATCTAGTGAGCAGTCAGGAAAGAGAACTTCCAGATCCTGGAAAT 

5221 QAGGGTTAGTATTGTCCAGGTCTAOCAAAAATCTCAATATTTCAGATAATCACAATACAT 

5281 CCCTTACCTGGGAAAGGGCTGTTATAATCTTTCACAGGGGACAGGATGGTTCCCTTGATC 

5341 AAGAAGTTGATATGCCTTTTt^CAACTCCAGAAAGTGACAAGCTCACAGACCTTTGAACT 

3401 AGAGTTTA€CTG<^IAAAGTATGTTAG1TCAAATTGTCACAGGACAGCCCTTCTTTCCACA 

5461 GAAGCTCCAGGTAGAGGGTGTGTAAGTAGATAGGCCATGGGCACTGTGGGTAGACACACA 

5521 TGAAGTCCAAGCATTTAGATCTATAGGTIGATCGTGGTATGTTTTCAGGCTAGATGTATG 

5581 TACTTCATGCTGTCTACACTAAGAGAGAATGAGAGACACACTGAAGAAGCACCAATCAT6 

5641 AATTAGTTTTATATGCTTCTGTTTTATAATWTGTGAAGCAAAATTTTTTCTCTAGGAAA 

5701 TATTTATTTTAATAATGTTTCAAACATATATTACAATGCTGTATITTAAAAGAATGATTA 

5761 TG AATTA CATTTGTATAAAATAATTTTTATATTTGAAATATTGACTTTTTATGCCACTAC 

5821 TATTTTTATGAAATATTATGTTAAAACTGGGACAGGGGAGAAOCTAGGGTGATATTAACC 

5881 AGGGGCCATGAATCACCTTTTGGTCTGGAGGGAAGCCTTGGGGCTGATCGAGTTGTTGCC 

5*1} CACAGCTGTATGATTCCCACCCACACACAGCCICTTAGATGCA^TTCTGAAGAAGATGCT 

6001 ACCACCAGTCTGACTCTTTCCATCAAGGGTACACTGCCTTCTCAACTCCAAACTGACTCT 

^^p^^CTGCATTATATTTATTACTGTAAGAAAATATCACTTGTCAATAAAATCCATA 
4121 CATTTGTGT (A) n 
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